Cucker-Smale Flocking under Hierarchical Leadership 

Jackie (Jianhong) Shen * 
School of Mathematics, University of Minnesota, Minneapolis, MN 55455, USA 

Email: jhshenOmath.umn.edu; Tel. (612) 625-3570 



O 
O 

(N 

> 

O 

O 

in 

d 
cr: 



Abstract 

A mathematical theory on flocking serves the foundation for several ubiquitous multi-agent phenom- 
ena in biology, ecology, sensor networks, economy, as well as social behavior like language emergence 
and evolution. Directly inspired by the recent fundamental works of Cucker and Smale on the construc- 
tion and analysis of a generic flocking model, we study the emergent behavior of Cucker-Smale flocking 
under hierarchical leadership. The rates of convergence towards asymptotically coherent group patterns 
in different scenarios are established. 

The consistent convergence towards coherent patterns may well reveal the advantages and necessities 
of having leaders and leadership in a complex (biological, technological, economic, or social) system with 
sufficient intelligence. 

Key words. Bio-flocking, leaders, leadership, Cucker-Smale model, dynamic graphs, graph Laplacian, 
Fiedler number, convergence, perturbation, free will. 
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1 Introduction and Motivations 



1.1 General Background on Flocking 

Flocking, a universal phenomenon of multi-agent interactions, has gained increasing interest from various 
research communities in biology, ecology, robotics and control theory, sensor networks, as well as sociology 
and economics. 

(i) (Biology and Ecology) The emergent behavior of bird flocks, fish schools, wolf packs, elephant 
herds, or bacteria aggregations, for example, has long been a major research topic in population and 
behavioral biology and ecology 013110131111101111]. 

(ii) (Robotics and Control) The coordination and cooperation among multiple mobile agents (robots 
or sensors) have been playing central roles in sensor networking, with broad applications in military, 
environmental control, or various field tasks [111 122 j . 

(iii) (Economy and Languages) Emergent economic behavior, such as a common belief in a price 
system in a complex market environment, is also intrinsically connected to flocking. The emergence 
of a common language in primitive societies is yet another example of a coherent collective behavior 
emerging within a complex system [5J |S] • 

The present work can largely be categorized into the biology realm, and has been directly inspired by 
the recent mathematical works of Cucker and Smale jlEj, as the title suggests. Mathematical abstraction 
and rigorous analysis are more focused herein than actual biological or physical realizability or feasibility. 
As in physics, the study of idealized models can often shed light on various observed patterns in the real 
world, if such models can indeed catch the very essence. 

In biology and physics, the main goal of flocking study is to be able to interpret, model, analyze, 
predict and simulate various flocking or multi-agent aggregating behavior. Most existing works have been 
focusing on modeling and simulation ^JH3]. See, for example, the several important models investigated 
by Flierl et al. |3] (and their stochastic formulation). The more recent paper of Parrish et al. f5| also 
provides a comprehensive comparison among some major existing models and their governing variables (in 
the context of fish schooling). Quantitative analysis (as in [103^2) on ^ ne asymptotic rates of emergence 
and convergence, on the other hand, has been relatively rare. 

Mathematical efforts are gradually gaining strength in this multidisciplinary area. In the continuum 
limit, for example, there have been several recent efforts made by Bertozzi's group |2L)l I21| . in which 
global swarming (i.e., with densely populated agents) patterns are modeled and analyzed via suitable 
spatiotemporal differential equations. Discrete-to-continuum limits of interacting particle systems have 
also been investigated by the same group ^ |H] recently. Consistent and generic mathematical analysis 
has been very much in an early stage for many biological aggregation phenomena. In the current paper, 
following the recent remarkable works of Cucker and Smale jl[3] on flocking analysis, we attempt to make 
further extension along the same line. 

1.2 Cucker-Smale Flocking Model 

Given a flock of k agents (birds, fish, wolves, etc) labeled by i = 1,2, ...,k, the Cucker-Smale flocking 
model is specified by the nonlinear autonomous dynamic system: 



where Xi(t) and Vi(t) are 3D (3 dimensional, which is non-essential) position and velocity vectors at time 
t, x = (xi, . . . ,Xk) 6 (M 3 ) fe , and C(i) C {!,■■■ k} denotes the subgroup of agents that directly influence 



Xi(t) = Vi, 

*i(t) = EjeC(i) aij{x){ 




i = 1 :k,t> 



(1) 
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agent i. Furthermore, the connectivity coefficients ctij(x) are in the form of 

aij(x) = w(\xi — Xj\ 2 ), for some nonnegative weight profile w(y). 

In the current paper, by Cucker-Smale flocking model, we require as in [IJ [3] that the interaction weight 
function w{y) takes the form of: 

H H 

where H and (3 are two positive system parameters. One shall see that the two (= vs. >) make no 
difference for the analysis hereafter as long as w{y) is bounded and sufficiently smooth (also see We 
also must point out that this model has been put in a more general and abstract setting in the subsequent 
work of Cucker and Smale [5]. 

The look of the system © is not entirely new. For example, the 2D model studied by Vicsek et al. [23] 
is very similar in which Uj's share the same magnitude (or speed) while their heading directions O^s satisfy 
a similar set of equations. 

It is the particular choice of the connectivity coefficients in (J2J) that has made the Cucker-Smale model 
mathematically more attractive. Vicsek et al.'s model (in discrete time) |23| can be considered as taking 
the following cut-off weight function: 

w(y) = w r (y) = l y < r 2(y), C(i) = {1, . . . , k}, Mi. 

That is, two distinct agents x% and Xj interact if and only if they are within a distance of r > 0, which 
is assigned a priori and fixed throughout. The lack of long-range interactions has made the model very 
difficult to analyze. For example, the remarkable efforts of Jadbabaie et al. on emergence analysis 
avoided the actual dynamic dependence of a, t j on the configuration x, but instead, they focused on an 
altered setting that involves switching controls. 

The main results of Cucker and Smale [1] can be summarized as follows: when (3 < 1/2, the flock 
converge to some translating rigid structure (moving at a constant velocity) unconditionally, i.e., regardless 
the initial configuration; and when (3 > 1/2, the initial velocities and positions have to satisfy certain 
compatible conditions so that the entire flock can converge asymptotically. 

In summary, in the modeling and analysis of Cucker and Smale jlj [3], not only are the conditions 
for pattern emergence easily verifiable (i.e., by checking the initial conditions), but the role of long-range 
interaction is also clearly quantified. A smaller (3 signifies more intense long-range interactions among 
agents while a bigger (3 leads to much weaker ones. It has been shown that the critical exponent (3 C = 1/2 
is sharp and necessary. Previously, the connection between global pattern emergence and individual action 
rules has often only been observed experimentally or addressed empirically (Vicsek et al. |23| . for example, 
experimentally observed phase transition induced by population density p and random fluctuation rj. A 
higher density corresponds to more interaction among agents, or loosely, smaller (3 in the Cucker-Smale 
model.) 



1.3 Motivations and Main Results of Current Work 

In the current work, we investigate the emergent behavior of Cucker-Smale flocking under hierarchical 
leadership (HL), which will be defined in detail in the next section. 

Roughly, an HL flock is one whose members can be ordered in such a way that lower-rank agents are 
led and only led by some agents of higher ranks. As explained in more details in Section 2, for HL flocks, 
it is often either nontrivial or impossible to define a "fixed" inner product so that the Fiedler number of 
the associated (graph) Laplacian can be exploited, which is the key to the original work of Cucker and 
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Smale [1] and its subsequent generalization [5]. The current work thus takes a somewhat different approach 
in order to fully benefit from the characteristic structures of HL. 

As far as applications are concerned, there are two types of HL: passive and active ones. 

(A) (Passive/Transient Leadership) 

(A.l) (Disturbed Bird Flocks) In nature, certain types of leadership emerge in a transient and 
dynamic fashion and is often prompted by a specific environment. For a disturbed bird flock 
at rest, for example, the bird that first senses the approach of an unexpected pedestrian or 
predator often takes flight first, warns others, and first gains full speed, and consequently flies 
ahead of the entire flock and serves as a virtual leader. 

(A. 2) (Driving in a Traffic) During rush hours, each individual driver mainly manoeuvres according 
to the moving patterns of several cars right ahead in the visual field. Thus a chain of leadership 
naturally arises and extends linearly along the traffic. The leadership here is also prompted by 
the environment rather than being intrinsic among the stranger drivers. 

(B) (Active/Intrinsic Leadership) 

(B.l) (Governmental/Miltiary Hierarchies) Such hierarchical leadership is inherent in various 
social groups or structures, and often leads to more efficient management. Examples include, 
the chain of President-Governor-Mayor in the governmental system, and the chain of command 
from the Commander in Chief all the way down to a soldier. 

(B.2) (Social Animals) For some social animals such as monkeys, wolves, or elephants the group 
or social status of each member is clearly recognized by others and stably maintained, and 
guides the action of each individual in the hierarchies. (See also the recent work of Couzin et 
al. for non- hierarchical but "effective" leadership.) 

Our main results are the three theorems summarized below. All HL flocks are assumed to have Cucker- 
Smale connectivity introduced in the preceding subsection. 

(i) (Section 3) For an HL {k + l)-flock marching at a sufficiently small discrete time step h, under the 
similar classification scheme according to /3 < f3 c , = /3 C , or > (3 C , as in Cucker and Smale [U03, the 
velocities of the flock converge at a rate of 0(p 7 ^n k ~ 1 ), where the factor ph £ (0, 1) only depends on 
h, system parameters, and the initial configuration of the flock. The critical exponent is given by 
(5 C = l/(2k), instead of (3 C = 1/2 in the original work of Cucker and Smale [I]. (For a 2-flock (with 
k = 1) they are the same. For k > 1, the (5 C herein could be over restrictive and due to the deficiency 
of the particular methodology adopted.) 

(ii) (Section 4) For an HL flock under continuous-time dynamics, when (3 < 1/2, there exists some 
B > 0, such that the velocities of the flock converge at an exponential rate of 0{e~ Bt ). The constant 
B only depends on the system parameters and the initial configuration of the flock. (From the simple 
calculation on an HL 2-flock, (3 C = 1/2 is sharp in order to achieve unconditional convergence.) 

(iii) (Section 5) For an HL (k + l)-flock [0, 1, ... , k] of which the overall leader agent takes a free-will 
acceleration vq = f(t) (thus the system is no longer autonomous), as long as the overall leader behaves 
moderately so that f(t) = 0((1 + i) _At ) for some [i > k, the velocities of the flock will still converge 
at a rate of 0((1 + t)"^"*)) when < 1/2. (By (ii) where f = 0, (3 C = 1/2 is again sharp for 
unconditional convergence.) 

We also mention that Jadbabaie et al. also studied (under discrete time and working with Vicsek et 
al.'s orientation model [23]) the effect of a single leader moving at a fixed constant velocity. As mentioned 
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above, due to the difficulty in dealing with configuration-dependent dynamics, the authors switched to the 
study of an altered control problem (under the assumption of intermittent joint connectivity). 

In addition to the three main sections mentioned above, definitions and further detailed background 
will be introduced in Section 2. The conclusion is drawn in Section 6. 

2 HL Flocks, and Definability of Compatible Inner Products 
2.1 Flocks under Hierarchical Leadership (HL Flocks) 

Definition 1 (An HL Flock) A (k + l)-flock is said to be under hierarchical leadership, if the agents 
(birds, fish, wolves, etc.) can be labeled as [0,1, ... ,k], such that 

(i) aij = a a g en t i i ec [ by j / implies that j < i; and 

(ii) if we define the leader set of each agent i by 

£(*) = {j I oij > o}, 

then for any i > 0, C(i) ^ (non-empty). 
If so, the flock is called an HL flock. 




HL-flock HL-flock not an HL-flock 

Figure 1: Two examples of HL flocks and one example of a non-HL flock. The arrow i — > j means that 
agent i is led by agent j, or equivalently, ay > 0. Visually, it means that i looks up to j. 

Notice that the second condition requires that, except for agent 0, all the others must be subject to 
some leadership. On the other hand, the first condition implies that £(0) = 0. Thus agent is the overall 
leader (direct or indirect) for the entire flock. Figure ^ depicts the connectivity structure of two HL flocks 
and one non-HL flock. 

Proposition 1 (Connectivity Matrix of an HL Flock) A (k + I) -flock is an HL flock if and only if 
after some ordered labeling [0,1, ... ,k], the connectivity matrix K = (ajj)o<ij<fc is lower triangular, and 
for any row i > 0, there exists at least one positive off-diagonal element aij. 

Subject to convenience, in what follows a generic HL flock shall be denoted by either [0, 1, ■ ■ • , k] or 
[1, • • • , k]. As in Cucker-Smale |lj or Chung define the graph Laplacian matrix by 

L = D — K, D = diag(do, • • • , d k ), d i = S ^a ij . (3) 

j 
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Similarly, define the two (non-orthogonally) complementing subspaces of 

'A 

< : 

- w fc+i 



span < 



and 



r /o\ 

3'1 



,1 



Then it is easy to see that 



A = Ker(L), and M. k = Range(L) is L-invariant. 

Notice that the kernel assertion is directly guaranteed by the second condition of an HL flock, without 
which the kernel could be larger. 

From now one, as in Cucker and Smale [UE3, we shall only consider the restriction of the Laplacian on 
the reduced space M. k . Then it becomes nonsingular, and shall still be denoted by L for convenience. We 
also must point out that when applied to actual flocking, the reduced Laplacian L is applied to R 3fc (instead 
ofW k ) via the three spatial dimensions individually. 

2.2 Definability of Compatible Inner Products 

The general framework of Cucker and Smale [3J relies upon the Fiedler number of the Laplacian operator 
L, i.e., the smallest positive eigenvalue in the reduced space. In particular, it assumes the existence of a 
fixed inner product (-, •) such that 



(Lv, v) > £{v, v), 



for any v 6 



(4) 



Then an a priori lower bound on £ = constitutes the core to the convergence results established 
by Cucker and Smale [3]. Below we show, however, that such inner products could fail to exist for 
non-symmetric systems like HL flocks. 

Theorem 1 Consider the special HL (k + \)-flock [0, 1, ... , k] such that C(i) = {i — 1} for i > 0, and an 
instant when a^i-i = a for some fixed a > and any i > 0. Then the smallest eigenvalue is £ = a, but 
there exists no inner product (•, •) in the reduced space M fc such that 

(Lv, v) > a(v, v), v G R k . 

Proof. It is easy to see that the (reduced) Laplacian L is given by 



L = L a 



a 
—a a 








—a a 



kxk 



In particular, L a = aL\, and it suffices to prove the case when a = 1. If such an inner did exist, one would 
have 

(L\v, v) > (v, v), or (Jv, v) > 0, 

where J = L\ — Ld. Notice that Jv = (0, —z\, . . . , —Zk-i) T for v = (zi, . . . , z^) T . 
Let e's denote the canonical basis of M. k , and define 



G = (gij) = {{ei, ej))kxk 
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to be the associated Grammian matrix of the inner product. Then G must be positive definite. For any 
v = (zi, ■ ■ ■ , z k ) T ', one has 

(v, Jv) = v T G ■ Jv = (zi, . . . , z k )(gij)(0, -Zt,..., -z fc _x) T . 

Consider a special vector in the form of w = wt = (0, ...,0,l,i) T eR k . Then 

(w t , Jw t ) = (0, ...,0, l,t)(gij)(0, ...,0, 1) T = gu-i,k + 9k,kt. 

Notice that g kk = ( e fc> e fe) > 0. Then for any 

\dk-l,k\ 



t < 



one must have (wt, Jwt) < 0, which is contradictory. ■ 
Even when such compatible inner products do exist, for a general non-symmetric flock, they often 
depend on the configuration of the flock, and are thus time-dependent. This causes much inconvenience 
or a potential impasse for the Cucker-Smale approach in The efforts in the current work follow a 

different approach by exploiting the specific structures of HL flocks. 

3 Discrete-Time Emergence 

Recall that in the continuous time, the Cucker-Smale flocking model is given by 



x = v, 

v = —L x v, t > 0, 



(5) 



Where the reduced Laplacian L = L x is defined as in @ and both x and v are considered in the reduced 
(quotient) space. For a (k + l)-flock, both of them belong to M 3fc . 
Fixing a discrete time step h > 0. Define 

x[n] = x(nh), v[n] = v(nh), and L n = L^ni. 

(Note: the parenthesis-bracket correspondence follows the convention in digital signal processing |19j.^ 
Then the continuous-time system © is discretized to 



J x[n + 1] = x[n] + hv[n], 

\v[n + 1] = 5[n]w[n], n = 0, 1, • • • 

where S[n] = S h [n] = Id — hL n . 

For an HL {k + l)-flock [0,1,..., /e] , recall that the reduced Laplacian is given by 



L, 



di [n] 
-a 2 i[n] d 2 [n] 













-o fe i[n] -a k2 [n] 
For i > 0, since the leader set C{i) ^ 0, we have 



-a k ,k-i[n] d k [n]^ 



kxk 



(6) 



(7) 



<k[n] = ^aij[ 

3=1 



n\ 



(8) 
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Under the Cucker-Smale model, one has for any j G 

H 



dijln] 



1 + \xj[n] - Xi[n]\ 2 /2j 



(9) 



where x% denotes the original 3D position vector of agent % (and the factor 1/2 is for convenience). In the 
reduced quotient space, one has Xi = Xi — x$ G M 3 since the original configuration vector x, G ]R 3 ( fe+1 ) and 



the reduced representation x G M are connected via: 





x 




~x ~ 











xo 


X = 


X\ 




XQ 


+ 


Xl 


- x 




xq 




Xk 




x _ 




x k 


- x _ 




XQ 



+ 



As a result, for any pair i,j > 0, 

| *^ i I — I *^ i j I ^ ( | *^ i | I | & j | ) 2 1 X I • 

In combination with (JHJ) and Q, this implies that under the Cucker-Smale connectivity, 



di[n] > 



H 



(l + |x[n]| 2 )^ 



i > 0. 



(10) 



Assume, as in Cucker and Smale that under suitable initial conditions (according to whether /?<,=, 
or > (3 C = l/(2/c)), one has the uniform bound on the reduced position vector: 



\x[n]\ 2 <B h , for n = 0,1, 



(11) 



where is a constant bound depending only on h, the system parameters H and /?, as well as the initial 
configuration. (The existence of is a crucial ingredient of the proof and will be further addressed 
immediately after this main line.) Then one has, for any n > 0, and i > 0, 



di[n] > 



H 



0- + B h )f>' 



(12) 



Proposition 2 (Uniform Elementwise Bound on S) For < h < — — , S^ fn] > for any i,j, and 



max Sij[n] < 1 — hd* := ph, n = 0, 1, • 



(13) 



Proof. By definition, 



jSfn] = Id — hL n = L n 

ha kl [n] ha k2 [n] 
Under the condition on h, for the off-diagonals i > j, we have 



1-Wi[n] 
/ia2i[n] 1 — hd2[n] 













ha kk -i[n] l-hd k [n] 



kxk 



Sij[n] = haij <hH < — <-. 
For the diagonals, since ajj < H, we have d, < (fc — 1)-?/, and 



^[n] = 1 - hdi > 1 - h(k - 1)H > 1 



1 1 
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Therefore, 



max Sij [n] = max Su [n] = 1 — h min di < 1 — hd* , 

ij i i 



which completes the proof. ■ 
Next, our goal is to be able to control the growth rate of the matrix iteration: 

S[n]S[n — 1] • • • S[0], as n — > oo. 

Normally, such asymptotic behavior is investigated via the so-called joint spectral radius (e.g., Strang and 
Rota PHI) Daubechies and Lagarias [7], or Shen [T71 I18j ): 

lim ||5[n]5[n-l]---5[0]||», 

n^oo 

which is often too complex to be feasible since the matrices evolve and generally do not commute. The 
approach below resembles the Lebesgue Dominant Convergence Theorem in Analysis |13j . 



Definition 2 (Domination) A matrix B = (6y) is said to be dominated by another matrix C = (cjj) of 
the same size, if 

\hj\ < (Hj, for any 

If so, we write B -< C. 

Proposition 3 If B -< C, there exists some constant a, such that 

\\ B \\ < a\\C\\, 

where a only depends on the type of matrix norm adopted. 

Proof. All norms in a finite-dimensional Banach space are equivalent. Therefore, it suffices to establish 
the inequality under any special matrix norm. Consider the Frobenius norm: 

||£|| 2 = trace(££ T ) = J2 b ij ^ Z)4 = tr ace(CC T ) = ||C|| 2 , 

with a = 1 (the superscript T here denotes transpose). The general constant a resurfaces when another 
norm is used instead. ■ 

Proposition 4 Suppose Bi -< d, for i = 0, . . . , n. Then 

B n B n -i • • • Bq -< C n C n -\ ■ ■ ■ Co- 

The proof is trivial. Next we define a "complete" lower triangular matrix T = (tij)kxk by 



1, i > j; 

0, otherwise. 



Then the elementwise bound established in Proposition |2 directly implies the following. 
Corollary 1 Let ph = 1 — hd* as in Proposition^ Then 

S[n] -< p h T, and S[n - 1] • • • 5[0] -< p\T n , n = 0, 1, . . . 
Lemma 1 Let T = (tij)kxk be defined as above. Then \\T n \\ = 0(n k ~ r ). 



Proof. Denote by J the k by k lower triangular matrix whose nonzero elements are all l's and only 
distributed right below the diagonal, e.g., the 3 by 3 case, 



J 





1 
1 



Then it is easy to see that 



T = / + J + ••• + J 
Since J k = J k+1 = ■ ■ ■ = Okxk, one can also write 



k-1 



T=^J m . 

m=0 

More generally, for any t with \t\ < 1, one can define 

oo 

T(t) = ^t m J m = {I-tJ)~ 1 . 



m=0 



Then 



mr = {i- tj)-« = f; ( m n ) (-t) m j m = E ( n + ™ ') tmjr ' 



Letting t — > 1, we have 



T n = Jim T(t) n =Y,( n + Z ^ Jm ~* °( nk ~^ T - 

m =0 ^ ' 



The proof is then complete via Proposition 01 I 
Combining all the preceding results in this section, we have arrived at the following conclusion. 

Theorem 2 In the discrete-time Cucker-Smale model {j^ for an HL (k + 1) -flock , for any sufficiently 
small marching step h (as in Proposition^ and Cucker and Smale ^ there exists some ph £ (0, 1) 
under the conditions similar to ^ based upon (3 <,=, or > (3 C = l/(2k), such that 

fc-i\ 



S[n]-S[0]*O(j%n*- l )T. 



In particular, one has 



\v[n]\ < O(j%n k - 1 )\v[0]\, n -> oo. 
The order constant in O(-) only depends on the size k of the flock. 

We point out that the polynomial growth rate 0(n k ~ 1 ) (coming from T n in Lemma^) is characteristic 
of triangular HL flocks. A "full" system would make the approach here infeasible since 



"l •• 


■ l" 


n 


(A 


If ^ 


■ i) 




1 f - 


i) = k n - x 


i •• 


• i 


1 ■■ 


• 1 


kxk 


W 












i ■■ 


• i 



The exponential growth rate k n would thus overpower p 1 ^ and lead to an exponential blowup. 

Finally, we further address the important issue raised earlier in the proof concerning the boundedness 
condition in [x[n]| 2 < for all n. The existence of the convergence factor p^ has crucially depended 



10 



on such a bound Bh- On the other hand, the very existence of Bh, as we intend to show now, depends on 
Ph- This entanglement is characteristic of the nonlinear Cucker-Smale flocking model (as well as in Vicsek 
et al. [22] and Jadbabaie et al. and makes this type of models difficult to analyze. In the rest of the 

section, we introduce the brilliant approach of Cucker and Smale in unraveling such entanglement, which 
then genuinely completes the proof. 

Lemma 2 For any given integer k > 1, 

00 

J2t m m k ~ l < (k - - t)~ k , Vie [0,1). (14) 

m=0 

Proof. Notice that the equality holds when k = 1. Generally, for any t G [0, 1), 

- i)Ki -*)-* = (*- 1)1 £ ( )(-tr 

m=0 ^ ' 

m=0 ^ ' 

00 

= Y,( m + k-l)---(m + l)t m 

m=0 

oo 



m=0 

which completes the proof. ■ 
We now apply the self-bounding technique developed by Cucker and Smale in [IJ |S] to establish the 

2 

bound \x[n]\ < Bh that is crucially needed in the proof of Theorem |2j It also explains the origin of the 
critical exponent (3 C = l/{2k) and its role. 

We thus return to the step in (|11|) . This time, instead of assuming a priori that \x[n]\ < Bh for all 
n > 0, we proceed as follows. Fix any discrete time mark N, and define 

|z|* = max \x\[n], G argmax 0<n<Ar |x|[n], (15) 

0<n<N — — 

and similarly define 

d * = (T+MF (16) 

Thus could be considered as a "localized" version of Bh, restricted in any designated finite time segment 
[0,N]. 

Then all the earlier analysis and results hold up to the bounding formula on \v\[n] in Theorem 12 as 
long as one restricts n within [0, iV]. In particular for ph = 1 — hd*, 

\v\[n] <Apln k ~ 1 , n = 0,...,N, 

where the constant A only depends on k but on neither n nor N. 

Therefore, by the first equation of HL flocking in ©, for any n G [0,iV], 

n—l n—l 

\x\[n] < |x|[0] + + 1] — x[m]\ = \x\[0] + h \v[m]\ 

m=0 m=0 

n—l 00 



< |z|[0] +AhJ2 pTm*- 1 < \x\[0] +AhY J P n h 

m=0 m=0 

< \ x \[o] + (k -iy.Ah(i - Ph )- k . 



m m k-l 
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In particular, for n = N*, 

\x\* = \x\[N*\ < \x\[0] + (k — l)\Ah(l - p h )- k . 

Now that 

(i - Ph )- k = h- k d: k = (hH)- k (i + \ x \if k , 

one has the Cucker-Smale type of self-bounding inequality for the unknown 

\x\* < \x\[0] + (k — \)\Ah{hH)- k {\ + \x\lf k . 
Define Z = (1 + \x\lY' 2 . Then 

Z < 1 + < c + bZ m , (17) 

with c = 1 + \x\[0) and b = (k - \)\Ah{hH)- k . 

The rest of analysis then goes exactly as in Cucker and Smale jU [3] . Define 

F(z) = z - bz s - c, with s = 2f3k, and z > 0. 

Then when s < 1, the nonlinear function F(z) has a unique zero after which F stays positive. Since 
F(Z) < 0, one thus must have Z < z*, or 

1 3? I * ^ ^ ^ -2* • 

Now that z* only depends on c and 6, which are independent of the pre-assigned time mark N, we have 
obtained the uniform bound 

\x\[N] < \x\[N*\ = \x\* <z^ ViV = 0,1,--- . 

Thus Bh = z"l is the uniform bound needed in the proof of Theorem El This is the case when f3 < f3 c = 
1/(2*0. 

The other two cases when = j3 c and (3 > (3 C (corresponding to s = 1 and s > 1 for F(z)) can 
be analyzed exactly in the same manner as in Cucker and Smale jU[3]) an d will be omitted herein. In 
particular, in both cases, there will be sufficient-type of conditions on the initial configurations in order 
for the bound to exist. In the third case (3 > (3 C , there will also be more stringent upper bound on the 
time marching size h. We refer the reader to Cucker and Smale for the detailed analysis on F{z) in these 
two cases. This completes the proof of Theorem |^1 

In the next section, we investigate the emergent behavior of the continuous-time HL flocking using quite 
different methods. There, the results hint that the unconditional convergence range (3 G [0, l/(2/c)) just 
established might still be extendable onto [0, 1/2), as in Cucker and Smale [3]. Thus the critical exponent 
P c = l/(2k) might be further improved if other alternative approaches are to be investigated in the future. 

4 Continuous-Time Emergence 

Let [1, . . . , k] be an HL /c-flock in that order, connected via the Cucker-Smale strength with parameters (3 
and H as in P|. In this section, we establish the emergence behavior for the entire flock when (3 < 1/2, 
via the methods of induction and perturbation. The associated intuition is as follows. If the sub-flock 
[1, . . . , i — 1] almost reaches convergence, it shall look like a rigid one-body to agent i. Then [1, . . . , i — 1, i] 
is not far from a simpler two-agent flock. Our goal is to develop rigorous mathematical analysis to quantify 
and support this point of perspective. (In this section, we shall work with [1, . . . , k] instead of [0, 1, ... , k] 
due to the lack of advantage of introducing index 0.) 
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4.1 The Property of Positivity 



The general properties to be established in this subsection are characteristic of the Cucker-Smale flocking 
model. They could be useful for any future works on the model, on top of their roles in the proof of the 
main results of this section. 

Let Xi,Vi £ M 3 denote the 3D position and velocity vectors of agent i. Recall that the Cucker-Smale 
flocking model is given by 

l ii = Vi l (18) 
[vi = -{L x v)i = Eje£(i) a ij( x )( v j ~ Wi)> 



for t > 0, i = 1, . . . , k, and x = (xi,X2, ■ . . , x&) € R . The Cucker-Smale connectivity strength is specified 
by 

a,ij(x) = a, j £ £>{i)- 

(As mentioned earlier in the Introduction, changing "=" to ">" does not affect the subsequent analysis 
as long as Oy(x)'s are bounded and sufficiently smooth.) Given a solution (x(t),v(t)) to the continuous 
Cucker-Smale model (|18|). we write for convenience 

Oij(t) = aij(x(t)), and L t = L x{t y 

Let r\ = (rji,r]2, ■ ■ ■ T]k) T £ K fe be scalars, and consider the following system of ordinary differential 
equations: 

i) = -L t r], t>0, given rp = i]\ t=0 - (19) 

Componentwise, we have 

i]i= ^2 Oij(t)(i1j -Vi), i = l,...,k. (20) 

je£(0 




Theorem 3 (Positivity) Suppose m > for i = 1, . . . , k. Then for all t > and i, rjiit) > 0. 
Proof. For any agent i in the flock, define 

£°(<) = {.}, 

£ m (i) = £(£ m ~ 1 (i)), all m-th level leaders of i, and (21) 
[jC] (i) = C°(i) U £ 1 (i) U C 2 {i) ■ ■ ■ , all leaders of i, direct or indirect. 
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Then it is easy to see that the system (|2U|) restricted on [C](i) is always self-contained, i.e., (to \ j G [£](i)) 
is not influenced by any variables in (rjj \ j [£](*)) (but certainly not vice versa). 

For convenience, we shall call the restriction of the system (|19|) or (|20|) on the sub-flock [jC](i) the [C](i)- 
system. Then it suffices to establish the theorem for each [C](i) system. In Figure [2 we have sketched an 
example of the hierarchies of leaders of a given agent i. 

Suppose otherwise that the theorem were false on an [C] (i)-system for some particular agent i. There 
would exist some j G and i > such that rfj(t) < 0. Define 

= inf{t > | there exists some j G [>C] (i), such that rjj(t) < 0}. 

Then < i* < i < oo, and we claim additionally the following. 

(i) For any j E [-£](«), > 0. 

(ii) There must exist some j G [£](£), and a sequence of moments (t n ) such that t n > t*, t n — > t* as 
n — > oo, and rjj(t n ) < 0. 

(hi) There must exist some G [£](£), such that r/j t (t*) > 0. 

(i) and (ii) result directly from the definition of t*. Suppose otherwise (hi) were false. Then in particular, 
for any j G [£](j), one must have r?j(t*) = by (i). Consider the [£](j)-system after t*: 

= Yl a fl^)^i ~ ^')' 3 G [^](i)> 1 > 
iec(j) 

Since this is a homogeneous system with zero initial conditions at t = t*, by the uniqueness theorem of 
ODEs (e.g., [TU]). the solution to the [C] (j)-system must be identically zero: i]j(t) = for any t > t* and 
j G [>C](j). Now that j G [£](j), one must have r)j(t) = for all t > t*, which contradicts to Property (ii). 
Thus (hi) holds. 
Define 

to = min{TO > | there exists some G C m (i), such that l]j*(t*) > 0}. 

Property (i) and (hi) imply that < to < oo. Then by iteratively differentiating the £(j)-system, one can 
easily establish: 

Vj(t*) = V-(t,) = ■■■ = ryf - X) (t*) = 0, > 0, 

which contradicts to Property (ii). Thus the theorem must hold and the proof is complete. ■ 
The most important consequence is the following bounding capability. 

Theorem 4 (Boundedness of Velocities under Evolution) The Cucker-Smale model M8\) has the 
following closedness properties. 

(i) Suppose Q is a convex compact domain in M. 3 , and for any agent i, initially Vi(t = 0) G O. Then for 
any t > and i, Vi(t) G Q. 

(ii) In particular, let Dq = maxj \vi(t = 0)|. Then \vi(t)\ < -Do f or all t > and i. 

Proof. Since the closed ball Bd q (0) in 1R 3 is convex and compact, (ii) is implied by (i). It suffices to 
establish (i). 

For any unit vector n G S 2 , and given vector o£l 3 . We first claim that if 

n ■ (vi - a)\ t=Q > 0, Vi, 
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then n ■ (vi(t) — a) > remains valid for all t > and i. To proceed, define rji = n ■ (vi — a). 

77 = n ■ Vi 



n 



it ■ I ^ ajj(t) - a) - (uj - a) 
V'e£(i) 

= -{L t r))i. 

Then by the preceding theorem, the claim is indeed valid: 77$ (i) > for alH > and %. 

For any compact convex domain f2, let jj : S 2 — > M 3 be its support function, so that for any unit 
direction n € S 2 , a = p(n) has the property that a £ dQ and the closed flat half-space 

vr„,_„ = {x g R 3 I (-n) • (s - a) > 0} 

contains fl When the domain is convex but not strictly convex, p(n) could be a set of points, which 
however does not influence the argument herein. Furthermore, we have 

n = n 

nes 2 

Since each half-space has just been shown invariant under the Cucker-Smale evolution, we conclude that 
f2 must be invariant as well under the evolution, which completes the proof. ■ 

4.2 Perturbation and Induction 

We now first prepare a lemma. Together with the boundedness property just established above, it facilitates 
the later analysis on the emergent behavior of HL flocks. 

Lemma 3 Suppose x(t),v(t) G R 3 (which could be considered as X2 — x\ and V2 — v\ for a 2-flock), and 
satisfy the perturbed 2-flock system parametrized by some T > 0: 

(* = ! ' (t > (22) 
y v = -a T (x, t)v(t) + e T (t), t>0. 

Assume in addition that the following conditions hold. 

(i) a T (x,t) > H 2 g , with (3 < 1/2. 

(1 + |x| z J p 

(ii) et G R 3 , and 

\e T {t)\ < ae~ b{t+T) \ for some r, G (0, 1]. (23) 
(in) \v{t)\ < D for all t > 0, and \x \ < R + D T. 

Here H, (3,a,b,r), Dq, and Rq are given constants independent of T . Let (x T (t) , v T (t)) denote the depen- 
dency on T. Then 

\v T (T)\<Ae- BT(1 - 2mAv ~ , (24) 

where rj~ = n — 5 for any small but positive 5 when r/ < 1, and rf~ = 1 when rj = 1, and A and B are 
two constants only depending upon H, (3,a,b,rj~ , Dq, and Rq (but not T). The notation a A b represents 
min(a, b). 
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Remark 1 We first make two comments on the conditions. 

(1) The all-time bound \v(t)\ < Dq seems very stringent, but is now natural by Theorem^Jin the preceding 
subsection. 

(2) As outlined in the beginning of the current section, the lemma will be applied during the induction 
process going from the sub-flock [1, . . . , i — 1] to [1, . . . , £]. To agent i, the perturbation factor £t(£) 
comes from the exponentially small dispersion of the leading sub-flock [1, . . . , % — 1] from reaching exact 
emergence. 

We now proceed to the proof of Lemma El 
Proof. From the equation for v, we have 

(v, v) = -a T (x,t)(v, v) + (v, e T (t)), or 

M • M* = \ (\ v \ 2 ) t = ~ a T\v | 2 + (v, e T (t)). 

Assuming that v is never identically zero on any non-empty open time interval (noticing that the oppo- 
site scenario trivializes the lemma on any such intervals and the following argument only needs a minor 
modification), one has 

\v\t < — clt\v\ + \et\ 

by the conditions (i) and (ii). By x = v and (hi), 

\x\ < \xq\ + / \v\(t)oIt 
Jo 

<R + D T + D t = R + D (t + T). 

As a result, 

\v\t < M + ae- b ^\ 

Then by the Gronwall-type integration, 

_ rt h dr _ rt h ds 

\v(t)\<\v \e (^(Ro+Do^+T)) 2 )" +a e -6(r+T)" . e Jr (i+(R Q +D (s+T))2f dj . 



<D -e (i+(Ro+D (t+T)) 2 f + ^L T l- Ve -bT^ 

br] 

We denote v (t) by v T {t) to indicate its dependency on T. Then 

br] 

< DQe -H{H,R M )T^ + c(0j 6j r)e -bTn- (when T > 1} 

where the two constants A and B are independent of T. Also notice that when r/ = 1, the monomial factor 
T 1_?? = 1 and the lowering from rj to r[~ is unnecessary in the first line. Finally, since |f T (t)| < -Do by 
the given conditions, by suitably increasing A, the condition T > 1 in the last second line can actually be 
removed. This completes the proof. ■ 
We are now ready to state and prove the main theorem. 
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Theorem 5 (Convergence of an HL Flock) Let [1,2,..., k] be a Cucker-Smale flock under hierarchi- 
cal leadership with (3 < 1/2. Then for some B > 0, which depends only on the initial configuration and all 
the system parameters, one has 

max \ Vi (t) - Vj(t)\ = 0(e' Bt ), t > 0. (25) 

l<i,j<k 

Proof. We prove the theorem by induction on the sub-flocks, from [1, — 1] to [1, . . . , I]. 
First we show that the theorem holds for a 2-flock [1, 2]. By definition, the leader set C(2) is nonempty 
and has to be C(2) = {1}, i.e., 021 > 0. Let x = X2 — x\, and v = V2 — v\. Then 

{x = v 
V = V2 — Vl = i>2 = a2l(v\ — V2) = -021V. 

Here 021 = cl2\{x) = : — prr-s^ with (3 < 1/2. Then Cucker and Smale's analysis in ,4 still applies 

(1 + \x\ z )P ' ' 

Bt\ 



directly, and \v(t) \ = 0(e ) for some B > 



X = X, - Xj , ^ \3 



\ £1 

I 



Figure 3: The induction process from [1, ... ,1 — 1] to [1, ... ,1—1,1] reduces the /-flock system to a perturbed 
2-flock system. 

Assume now that the theorem holds for the sub-flock [1, . . . , I — 1], we intend to show that it must be 
true for [1, ... ,1 — 1,1] as well for I > 2. As a result, the main focus shall be the agent I. 
By induction, there exists some b > 0, such that 

max \vi(t) - Vj(t)\ = 0(e~ bt ), t -> 00. (26) 
i,je{i,-,l-i} 

Define the average velocity (of the direct leaders of agent I) by 

= d ^ Vi ^ } with dl = 

iec(i) 

Then for any j G C(l), 

\vj(t) - vt(t)\ < ^ E \°S ~ «<l = °( e_W )' ( 27 ) 

by the induction assumption. Similarly, define 

1 

di 



di 



Xl(t) =— J] X i( t )' and X ( t ) = X l( l ) ~ V ( t ) = - ^(*)- 
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Then x = v , and 

dvi v-^ i \ 



dm (28) 



= E ai i " ~ w ') + E ' ^ " d b (]l ■ 

jec(i) jec(Q 

* v ' 

e(t) 

Since each vi (i € C{1)) is the linear combination of some (yj — Vi)'s with j € £(i) C {1, . . . , I — 1}, by ()26[). 
one must have 

dt y ' 

Similarly, due to (|2T)) and the boundedness of ay's, one has 

| a lj -(v j -v l )\=0(e- bt ). 

In combination, we conclude that 

\e(t)\ < ce~ bt , t > 0, for some c> 0. (29) 

On the other hand, define 

a = ^ a H = — 

+ \Xj — xA 

Then ()28|) simply becomes 



E a ^ = E n m g _ , 2 yr ( 3 °) 



?) = — av + e. (31) 



Define q(s) = with s > 0. Then q(s) is convex, and 

v 1 (1 + s)/ 3 ~ w 



As a result, when Sj = \xj — xi\, 



\ E ^)>a[j E s i • 



E n + \ x H _ x .\2)p ^ d n ; 70- ( 32 ) 

By the least-square principle, 

X 

— ^2 \xi - Xj\ 2 = \ x\ - xi | 2 + — ^2 \ x j-xi?i ( 33 ) 
' ieC(i) 1 jeC(l) 

since xi is the center or mean of {xj \ j £ C(l)}. By the induction assumption on the emergence of 
[1, — 1] D jC(Z) , there exists some M > 0, such that 

j £ N-^| 2 <^-1- (34) 
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Combining Eqn.'s PU|) through (|34[). we have 

d H H 

Q = t] * (M+W " (1 + W (35) 

where the updated constant = H(H,di, M, 0). (Notice that the notation a(x,t) summarizes all the 
influence from {xj \ j € C(l)} into the t-variable.) 

The combination of (|29|). (|31|). and 1)35(1 leads to the reduced system: 



di ' a; 

je£(/) ie£(l) 



:(T)-x(0)| < |ar,(T)-x,(0)| + j £ 1^(^-^(0)1 

' jec(i) 

<^ T +i V ^T = D T. 
~ 2 di ^ 2 u 



As a result, 



(36) 



\ x = v 

\v = —a(x, t)v + e(t), 

with a(x,t) > - j — r-j, and |e(t)| < ce~ ht . In order to apply LemmaOl further define 

(l + \x\)P 

Dq = 2 max \vi(t = 0)|, and R$ = 2 max = 0)|. (37) 

l<i<fe l<i<fc 

Then by Theorem 0J we have 

\vi(t)\ < : y, V«andi>0. 



Consequently, 

;(t)| < ^ E K' flo = D °" (38) 

Similarly, for any T > 0, 



|x(T)| < \x(0)\+D T 

^ N(o)l + 4- E M°)I + A)T 

* iec(0 (39) 

To conclude, for any T > 0, if we define 

x T (t) =x(t + T), v T (t) = v(t + T), a T {x T ,t) = a(x T ,t + T), and e T (t) = e{t + T), 

then, 

( ■ T T 

J X = V 

yv T = -a T (x T ,t)v T + e T (t), t>0, 

and all the three conditions in Lemma|3]are satisfied (with r/ = 1). Therefore, there must exist two positive 
constants A and B, such that for any T > 0, 

\v(2T)\ = \v T (T)\ < Ae- §Ta - W = AT**-". 



19 



Since T is arbitrary, we thus must have, after adjusting the constants, 

\v(t)\ < Ae~ Bt 13 , t > 0, for some constants A and B. 
Moreover, since (5 < 1/2 by assumption, one then must have 

;-oo 

/ \v(t)\dt < CO, 

Jo 

which in return implies that there exists some constant M > 0, such that 

\x{t)\ < M, t>0. 

Then by repeating the similar calculation in the proof of Lemma El assisted with this new constant bound 
\x\ < M instead of \x\ < Rq + Do{t + T) there, one arrives at: 

\v(t)\ < A'e~ B '\ (since rj = 1), 

for two positive constants A' and B' independent of t. Combined with the induction base (|26|). we thus 
conclude that the theorem must hold true for the sub-flock [1, ... ,1 — 1,1] with the exponent coefficient 
B = B' A b. This completes the proof. ■ 



5 HL Flocking Under a Free-Will Leader 

In this section, partially inspired by the preceding perturbation methods, we investigate a more realistic 
scenario when the ultimate leader agent (in an HL flock [0, 1, . . . ,k]) can have a free-will acceleration, 
instead of merely flying in a constant velocity 

The following phenomenon is not uncommon near lakes, grasslands, or any open spaces where a flock 
of birds often visit. When the flock is initially approached by an unexpected pedestrian or a predator from 
a corner on the outer rim, the bird which takes off first (and alerts others subsequently) generally takes a 
curvy flying path before it reaches a stable flying pattern with an almost constant velocity. Such a bird 
gains the full speed fast, flies ahead of the entire flock, and serves as a virtual overall leader. 

For an HL flock [0, 1, . . . , k], in addition to the Cucker-Smale system 

Ui = v i (t) m 
[ v i = Eje£(i) aij{x)(vj(t) -Vi{t)), i>0, 

we now also impose for the ultimate leader agent 0: 

h =vo (4i) 

\v = f(t), t>0, 

coupled with a given set of initial conditions. For convenience, we shall call f(t) the free-will acceleration 
of the leader. In combination, the new system is no longer autonomous. 
The main goal of this section is to establish the following theorem. 

Theorem 6 Suppose an HL (k+1)- flock [0,...,k] with a free-will leader satisfies both \4-Ujj and \4l\l , 
with the Cucker-Smale connectivity strength of (3 < 1/2. In addition, assume that the leader's free-will 
acceleration satisfies 

\f(t)\ = 0((1 +t) _M ), with some exponent fj, > k. 
Then the flock still has the following emergent behavior: 



max 

0<i 



ax \ Vl - Vj \( t ) = 0((l + t)-^- k A 

,j<k \ J 
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Remark 2 We first make two comments regarding why one should expect to put some regularity conditions 
on the leader's behavior in order for a coherent pattern to emerge asymptotically. 

(1) Intuitively if the leader keeps changing its velocity substantially it will be more difficult for the entire 
flock to follow and behave coherently. An extreme example is a flock with a drunken leader which 
flies in a Brownian random path. Then the entire flock cannot be expected to synchronize with the 
unpredictable motion of the leader instantaneously. 

(2) In the theorem, the decaying constraint fi > k depends on the size k of the flock. Thus qualitatively 
speaking, it requires the leader to exert less free will when the flock is larger, in order to lead a coherent 
flock asymptotically. Consider the special hierarchical leadership under a linear chain of command: 

k^k-1^ ► 1 -»• 0. 

The tail agent k has to go through all the k intermediate stages to sense any move that the leader is 
making. Thus intuitively, there will be a long time delay in between, and the leader has to be tempered 
enough to allow its distantly connected followers to respond coherently. 

We first prepare a lemma that is similar to Lemma 131 Since the new non-autonomous system does not 
necessarily have the positivity property, we take a slightly different approach. 



Lemma 4 Let x,v,g G M 3 , and satisfy 



x = v(t) 

v = —a(x, t)v(t) + g(t). 



Suppose that 



a(x,t) > — : — jg— for some j3 < 1/2, and, 

\g(t)\ = O ((1 , with some constant rj > 1. 

Then, \v(t)\ = 0((1 + i) - ^ -1 )) with the order constant only depending on the initial conditions x(t = 
0),v(t = 0), and H, (3, and r\. 

Proof. From the second equation, one has 

\v\ ■ \v\ t = = (v, v t ) = -a(v, v) + (v, g) < -a\v\ 2 + \v\ ■ \g\ 

Assume that v does not vanish identically on any non-empty open intervals for the same reason as in the 
proof of Lemma 01 Then one has 

\v\t < ~ a \ v \ + t > 0. 

Fix any time T > 0, and define 

H H 

= sup|x|(t), and a* = inf = (42) 

Then one has 

\v\ t < -a*\v\ + \g\, te[0,T]. (43) 
Since a* is constant, integration yields 

\v\{t) < M(0)e- a **+ f \g\{T)e- a * (t ' T) dT. 



o 
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In particular, for any t < T, 

f't f'OO 

\v\(t) < \v\(0) + / \g\(r)dT < \v\(0) + / \g(r)\dr := A Q . 
Jo Jo 

(Since rj > 1 by assumption, the integral of \g\ is finite.) Now that Aq is independent of the time mark T, 
we conclude that the last upper bound must hold for any t > 0: \v\(t) < Ao,t > 0. Therefore, from the 
first equation x = v(t), one has 



\x\(t) < \x\(0) + f \v\{r)dT < B + A Q t, t > 0, 
Jo 



'o 

where Bq = \x\(0). In particular, for any time mark T > 0, the quantities in (|42|) are subject to: 

H 



\x\* < B + A T, and a* > 



[1 + (B + A T) 2 ]P- 

We then go back and integrate the inequality (|4S[) again, but from T/2 to T this time: 

rT 
IT/2 



\v\(T) < \v\(T/2)e~^ + f \g\{T)e- a * {T - l) dt 

Jt/2 

HT/2 fOC 

< A e [i+(s +a t)2]^ + / | 5 |(t)dt 

JT/2 

poo 
JT/2 

= A o e-^ 1+T ^ 20 +o((l+T)-^ 



Since /3 < 1/2, we conclude that 

\ V \(T) =0({l+T)-^-V 



where the constant in O(-) is independent of T. Since T is arbitrary, the lemma is established. ■ 
We are now ready to prove Theorem H3 Details on some similar calculations will be directed to the 

proof of Theorem [21 

Proof. It suffices to prove the following more general result: 

max \ Vi - vMt) = O ( (I + t)-^- l A , t > 0, (44) 

0<i,j<l V / 

for any sub- flock [0, 1, . . . , I] and I > 1. 

When I = 1, define x = x\ — xq and v = v\ — vq. Then x = v, and 

v = vi-v = a 10 (v - vi) - f = -a w v - f. 

By the definition of an HL flock, C{1) ^ 0, and it has to be agent 0, implying that aio is subject to the 
Cucker-Smale formula. Then by the preceding lemma (with r/ = fi), one has 



\v\(t)=0[(l + tr { ^ 1) ), 



and (jSJ) holds. 

Suppose now that Q44|) is true for the sub-flock [0,1, ... ,1 — 1] with 2 < I < k, so that 



max \ Vi - Vj \(t) = O ((1 + t)-^ l+1 A . (45) 
0<i,j<l — i \ / 
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As in the proof of Theorem |S1 define the average features of the direct leaders of agent / by: 

71 xj, and vi = — y~] v j: d t = #£(l), 
1 J6£(0 ' j&C(0 

and x = xi — xi and v = v\ — v\. 

Then as in the proof of Theorem |SJ one has x = v and 



v = —a(x,t) ■ v + gi(t), with 

dt ' 



gi(t) = a H ■ ( v j-vi) ~ ^ 

a(x,t) = oj 3 -(a;j - Xj). 
j'e£(Q 

We first estimate Since \aij\ < i? and C [0, 1, . . . ,1 — 1], by the induction assumption (|45jl. the first 
term in gi must be of the order 0((1 + with rj = fi — I + 1. For the remaining second term in let 
1og£(0 denote the logical variable which is 1 when agent belongs to £{l), and otherwise. Then 

dVl 1 y-^ . 11^. 



Now that = f(t) = 0((1 + i) M ), and each t>j with j £ £(/) \ {0} is some linear combination of (u s — i>j) 
with s's in £(j) C [0, 1 , . . . , Z — 1] . Thus by the induction assumption (|45|) . one must have ^ = 0((l+t) _,? ) 
with 77 = fj, — I + 1. 

We now estimate a(x, i). Since /i > fc by the given condition, we have fj, — I + 1 > k — Z + 1 = 1. As a 
result, by the induction assumption on the sub-flock [0,1, ... ,1 — 1], for any i, j < I — 1, 

\xi - Xj\(t) < \xi - xj\(0) + / |«i - Vj\(r)dr 



o 



\(°) + f + dr < oo, Vt>0. 



Therefore the boundedness property in (|34[) still holds, and the same calculation in the proof of Theorem[5] 
leads to 

H 

a{X,t) ~ (l + |x| 2 )/3' 

for some constant # = H(H, di,j3, f, initial conditions of [0, — 1]). 

Combining the estimations on gi and a, one sees that x[t) and v(t) satisfy a perturbed system as in 
Lemma 0] with rj = — l + Therefore, by Lemma 01 

h - dm = \v\(t) = O ((1 + ij-fo- 1 )) = o ((i + 1)-^ . 

Now that by the induction assumption, for any j < I — 1, one must have 

|^ - fi,|(t) = O ((1 + , since |^ - Wi | = O ((1 + t)~ ( "~ Z+1) ) , V i G £(/). 

Therefore, for any j ' < / — 1, 

< h-«i| + K-v2l = 0((i + *) _(/1_0 )- 

This completes the proof of (|4"5)) . and thus the entire theorem. ■ 
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Corollary 2 Under the same statements as in the preceding theorem, suppose \i > k + 1, then there exists 
a constant configuration (djj)o<j,j<fc with dij € M 3 , such that 

lim (xi(t) - Xj(t)) = dij, < i, j < k, 
and the convergence rate is O ((1 + t)~ ( - p '~ k ~ 1 ^ . 

6 Conclusion 

In this paper, we have investigated the emergent behavior of Cucker-Smale flocking under the structure of 
hierarchical leadership (HL). The convergence rates are established for the general cases of both discrete- 
time and continuous-time HL flocking, as well as for HL flocking under an overall leader with free-will 
accelerations. 

In all these cases, the consistent convergence towards some asymptotically coherent patterns may reveal 
the advantages and necessities of having leaders and leadership in a complex (biological, technological, 
economic, or social) system with sufficient intelligence and memory. 

Our future work shall focus more on extending the results herein onto other flocking systems or lead- 
ership structures, a few of which have been mentioned in the Introduction. 
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